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PRECISE TAIL ASYMPTOTICS FOR ATTRACTING FIXED POINTS OF 
MULTIVARIATE SMOOTHING TRANSFORMATIONS - COMPLETE PROOF 

DARIUSZ BURACZEWSKI*, SEBASTIAN MENTEMEIERt 


Abstract. Given d > 1, let (Ai)i>i be a sequence of random d x d real matrices and Q be a 
random vector in We consider fixed points of multivariate smoothing transforms, i.e. random 
variables X E R^ satisfying 

X has the same law as E ■^iXi + Q, 

i>i 

where (Xi)i>i are i.i.d. copies of X and independent of {Q, {Ai)i>i). The existence of fixed 
points that can attract point masses can be shown by means of contraction arguments. Let X 
be such a fixed point. Assuming that the action of the matrices is expanding as well as con¬ 
tracting with positive probability, it was shown in a number of papers that there is jS > 0 with 
limt^oo {{u, X) > t) = K ■ f{u), where u denotes an arbitrary element of the unit sphere and 
/ a positive function and K > 0. However in many cases it was not established that K is indeed 
positive. 

In this paper, under quite general assumptions, we prove that 

liminf {{u, X) > t) > 0, 

t—>-oo 

completing, in particular, the results of iMire 3 ll201.?h and iBuraczewski et alJ ll2ni3h . 


1. Introduction 


1.1. The (multivariate) smoothing transform. Let d > 1. Let {Q, {Ai)i>i) be a random el¬ 
ement of X M{d X d, R)”^, that is Q is a random vector and is a sequence of random 

matrices. We assume that the random number N := max{i ; Ai ^ 0} is finite a.s. If V G R'^ is a 
random variable such that 


N 


( 1 . 1 ) 


X has the same law as AiXi + Q, 


i=l 


where {Xi)i>i are i.i.d. copies of X and independent of {Q, (Ai)i>i), then we call the law C (V) of 
X a hxed point of the (multivariate, if d > 1) smoothing transform. By a slight abuse of notation, 
we also call X itself a fixed point. 

Eq. (HU has drawn a lot of attention for decades. In the univa.riate case this equation occurs in 
variou s areas, e.g. the analysis of recur sive algorithms ( Rosier (ll99lll200lh : lNeininger and Riischendo^ 
( 2004llL branching particle systems (Durrett and Liggett jjBssIl l. Google’s PageRank algorithm 
( Jelenkovic and Olvera-Craviotol ( 2012bl al: Chen et al. ( 2014ll L Also the multivariate situation draws 
a lot of attention. A classical example where (11.111 appears is the joint d istribution of key compar¬ 
isons and key exchanges for Quicksort ( Neinineer and Rlischendorl ( 20041 11. However in this case the 
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action of the matrices is purely contracting, and therefore all fixed points have exponential moments, 
whi ch is not in the scope o f the present paper. Some recent examples are related to kinetic models. 


Bassetti and MatthesI ( 2014ll . Then solutions to (11.11) describe e.g. equilibrium distribution of 


see 

the particle velocity in Maxwell gas. 

The aim of this paper is to describe the tail behavior of fixed points, i.e. the decay rate of 

P(|X|>t) or P((u,X)>t), 

as t goes to infinity, where u denotes an arbitrary element of the unit sphere S. Our focus is on 
the multivariate case, where we resolve the open question, whether the limit limt_).oo P (|^| > t) is 
indeed positive. 


1.2. Univariate smoothing transform. In dimension d = 1, comp lete results about th e struc ¬ 
ture of fix e d points are available under v e ry weak assump t ions, see Durrett_andMjiggeB (); 


ture ot nx e a points are available under v e ry weak assump t ions, s ee lUurrett and Eiggett (); 
Liul (Il99^: Biggins and Kyprianou ( 1997 ): Alsmever et al.l ( 2012h: Alsmever and Meiners ( 2013 1: 
Buraczewski and Kolesko ( 2014[l for the case of Ai > 0, and Iksanov and Meiners ( 201511 for the 
most general case of Aj G R. It turns out, that the characterization depends on the function 


( 1 . 2 ) 


N 

m{s) := |Ai 


which is log-convex, and in particular on the value a = inf{s > 0 : m(s) = 1}. It is shown 
that there are two classes of fixed points: Fixed points are either mixtures of a-stable laws and 
attract (only) laws with a-regular varying tails, or have a finite moment of order a + e for some 
e > 0 (subject to the assumption E|(5|“~'’^ < oo) and attract point masses.The “relevant” solutions 
in most situations are those from the second class, which we call attracting fixed points in the 
sequel, and it is therefore important to investigate their properties , such as tail behavior . Unde r 
various assumptions on (Q, (A,; ) p i ) an d iV, it has been shown in Guivarc’l] dlQQOll : iLiiJ (l200l[l : 
Jelenkovic and Olvera-Craviotol ( 2012bl l3l that if, roughly speaking, there is f > a with m(/3) = I 
and E \Qf < oo (including the case Q = 0), then 


(1.3) 


lim t^P (jXl > t) = A" > 0. 


There is also a rich literatu re concerning the case whe n a is the u n ique point such that m(a) = 1 , 
that is when mAa) = 0 (see Durrett and Liggett ( 1983ll : lLhJ (1998); Bigerins and KvnrianoiJ ( 2005 1: 
Buraczewskil ( 2009ll ; Buraczewski and Kolesko ( 2014ll l. 

It is obviously a very important question, whether K is indeed positive, si nce otherwi s e, t^ might 

be no t the precise rate. For > 0 and (5 = 0, positivity of K is prove d in Guivarc'^ (Il990l): Liu _ 

(20^b but it remained - except for some special cases - an open question in Jelenkovic and Olvera-Graviotol 

(|2012bl la[l , where t he cases (5 A 0 resp. A ,; € R were considered. This question was answered in the _ 

consec utive naners lAlsmever et al.l ( 2013l l by usin g complex function arguiii ents. rielenkovic and Olvera-Gravioto 
( 20141) (only for the inhomogeneous case) and in Buraczewski et al. ( 2015ll by using large deviation 
estimates (but for i.i.d. (A^) only). 


1.3. What is this paper about? In the multivariate setting d > I, an analogue of the function 
m can be defined (details give n below), and in the case where the Ai, Q and X all have nonnegative 
entries, it has been shown in iMentemeier ( 20161) . that again fixed points are either mixtures of 
multivariate a-stable laws (with a defined as before), or have a finite moment of order a -I- e, if 
E|Q|“+" < oo. 
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Tail b ehavior of attracting fixed points in this setting has been analyzed for the case a = 1 and 
(5 = 0 in iBnraczewski et al.l (|20141 , where it has been shown that 


(1.4) 


lim ((u, X) > t) = Kr{u) 


with a positive continuous function r on S> := S fl [0,oo)'^ for f3 being the unique value such that 
j3 > a and m(/3) = 1. In this case, also positivity of K has been pro ved. 


The inhomogeneous case Q ^ 0, with a < 1/2, has been studied in iMirekI l|2013f) and the existence 
of the limit in Eq. (El is proved there, but it remained an open question, whether K is positive 
(at least for /3 < 1). 

Th e case of invertible matrices {Ai)i>i was studied in Bassetti and Matthesl ( 2r)l3l and Bnraczewski et al, 
( 2013l l. with tail behavior being studied mainly in the latter paper. There once more existence of 
the limit in Eq. El was proved, but not the positivity of K. 

The contribution of this p aper is to prove the posit ivity of K in all these cases. We will fol¬ 


low the strategy developed in iBuraczewski et al.l (j2015ll . getting rid at the same time of some of 


its restrictions. We us e this approach since we were unable to find an ex tension of the method used in 
Alsmever et al.l ( 2013h to the multivariate setting, and since the proof in Jelenkovic and Olvera-Cravioto 
( 201 ill relies on the assumption P (|Q| > 0) > 0. 

The main tech nical ingredient is a large deviation r esult for products of random matrices, which 
was developed in IBuraczewski and Mentemeien ()201 ItI 1 and provided for several classes of random 
matrices. In the next section we introduce the three classes of matrices which are considered in this 
paper and further notation relevant for the multivariate case. The main results of this paper are 
formulated in Section [3] The remaining sections are devoted to the proof. 


2. Notations 

In t his section, we describe, in an abbreviated form, but similar to IBuraczewski and Mentemeier 
(|2015l) three sets of assumptions for random matrices, namely condition ((7) for nonnegative matrices 
and conditions (i-p) and (id) for invertible matrices. Each set of ass umptions guarantees prec i se lar ge 
deviation estimates extending the Furstenberg-Kesten-theorem ( FnrstenberEf and KestenI ( 196 (TI i 1. 
i.e. the SEEN for the norm of products of random matrices. These large deviation estimates will 
play a prominent role in our proof below. Eet d > 1. Given a probability law p on the set of 
d X d-matrices M(dx d, R), let (M„)„gN be a sequence of i.i.d. random matrices with law p. Equip 
with any norm j-j, write ||m|| := sup 3 ,gg |ma;| for the operator norm of a matrix M and denote 
the unit sphere in R'^ by §. We write 


m • a; := ■;-r, x 

|ma;| 


for the action of a matrix m on S (as soon as this is well defined). If § is invariant under the action 
of M, we introduce a Markov random walk ({7„, S'„)„gN on § x R by 


(2.1) [/„ :=log|M„...MiC/o| =log|M„[/„_i|+^„_i, 


for some initial data I/q € S, the value of which we note by the convention P„ (I/q = m) = 1, u € S. 

Below, the following concepts will appear several times: Write T := [supp p] for the semigroup of 
matrices, generated by the support of p. A matrix m with an algebraic simple dominant eigenvalue 
Am, that exceeds all other eigenvalues in absolute value, will be called proximal, and we will denote 
by iim S § the corresponding normalized eigenvectors (i;^ = —v^), using the convention that 
min{i : (nm)i > 0} < min{i : (v^)i > 0}. Note that a matrix with all entries positive is proximal 
by the Perron-Frobenius theorem, and that is the Perron-Frobenius eigenvector. 
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2.1. Invertible Matrices: Condition (i-p). The condition (i-pj (irreducible and proximal), de¬ 
scribed below, is due to Guivarc’h, Le Page and Raugi an d was studied in detail i n seve ral articles 
by these authors, the most comprehensive one of which is iGuivarc’h and Le Page! l)2015h . 

Let now ^ be a probability measure on the group GL{d,'R) of invertible dx d matrices. Then the 
measure fi is said to satisfy condition (i-p)., if 

(1) There is no finite union W = Ur=i subspaces 0 ^ Wi C which is P-invariant, i.e. 

PW = W. (strong irreducibility) 

(2) P contains a proximal matrix, [proximality) 

It may happen that there is a P-invariant proper closed convex con e C. This situation is very 
similar to the case of nonnegative matrices, see lBuraczewski et al.l ( 2014 ). Therefore, we will exclude 
it and only consider matrices satisfying 

(i-p,o) fjL satisfies (i-p), and there is no P-invariant proper closed convex cone. 

In this case, it can be shown that the Markov chain (t/„) has a unique invariant probability 
measure, which is supported on 


14(r) := {zim S S : m € P is proximal }, 


and due to the strong irreducibility, the orthogonal space of G(r) is {0}. Finally, write 


(.(m) := inf |ma;| 


-1 


1-1 


2.2. Nonnegative Matrices: Condition (C). Next, we introduce a condition on nonnegative 
matrices, i.e. all entries greater or equal to zero, which do not need to be invertible. We will use 
similar notation as for condition (i-p), in order to highlight connections. Note that these assumptions 
can be formulated more generally for matrices leaving invariant a proper closed convex cone, see 
Buraczewski et al. 1 2014h . 


Denote the cone of vectors with nonnegative entries by ]R> and write §> = {a; € : \x\ = 1} 

for its intersection with the unit sphere. It is invariant under the action of allowable matrices, i.e. 
matrices having nonnegative entries and no zero row nor column. For an allowable matrix, the 
quantity 


t(m) := min |ma;| 

aiGS> 


is strictly positive and is the suitable substitute for t as defined for invertible matrices. 

We say that a probability measure p on nonnegative matrices satisfies condition (C), if: 

(1) Every m G supp^ is allowable. 

(2) [supp/r] contains a matrix all entries of which are strictly positive (a positive matrix). 

Once again, this guarantees the existence of a unique invariant probability measure for (17„) on 

S>, which is supported in 


E(r) := {vm : m G F is a positive matrix }. 


Note that for nonnegative matrices, being positive is a stronger assumption than proximality, 
for it also asserts irreducibility: A diagonal matrix might be allowable and proximal as well, but in 
contrast to a positive matrix, its dominant eigenvector is not attractive on the whole set S>. This is 
why no assumption on invariant subspaces is needed here. Instead, we have to impose an additional 
non-lattice condition for (S'„), which is automatically satisfied under (i-p): Define 

S'(r) := {logAm : m G F n int(A4+)}. 

Then we say that /i is non-arithmetic, if the (additive) subgroup of M generated by S'(r) is dense. 























TAIL ASYMPTOTICS FOR FIXED POINTS OF SMOOTHING TRANSFORMATIONS 


5 


2.3. Invertible Matrices: Condition fid ). The third set of assumptions, c alled (id) for irre- _ 

ducible and density, appears first at the end of lKesteiJ ( 19731) and was elaborated in Alsmever and Mentemeier 
( 2 OI 2 II . In fact, it can be shown to imply condition (i-p,o). Due to the stronger assumption that fi 
is absolutely continuous, it often allows for simpler proofs, this is why we include it as an extra set 
of assumptions. 

A probability measure p, on GL{d,M.) is said to satisfy condition (id) if 

(1) for all open B CE> and all x G S, there is n G N such that P (n„ ■ x € B) > 0, and 

(2) there are a matrix mg G GL{d, R), <5, c > 0 and ng G N such that 

P(n„oGdm) > clB,(i„o)(m)/(dm), 

where I denotes the Lebesgue measure on ~ M((i x d, R). 

The classical exa mple is p, having a density about t he identity matrix. 

It is shown in ( Alsmever and Mentemei^ . 20I2L Lemma 5.5) that Up. is a Doeblin chain under 
cond ition (id). The support of its stationary probability measure is S by (jAlsmever and Mentemeierl . 

2 OI 2 L Proposition 4.3), therefore in the case of (id) we have P(r) = S. 


2.4. Markov random walk and change of measure. Below, we identify 5 = S> in the case 
of nonnegative matrices and 5 = S in the case of (i-p)- or (^idj-matrices. Given a measure p, on 
matrices as before, set 

Ip := {s > 0 : ||M||® < 00 }. 

Then, for s G Ip., we define operators in the set C (5) of continuous functions on S by 


(2.2) P^f{x):=E[\Mx\^ f{M-x)] 

It w as proved in KestenI ( 1973 ); Bnraczewski et ahl (j20I4[l for nonnegative ma trices, in Guivarc’h and Le Page! 
(2015) for invertible matrices under condition (i-p,o) and in iMentemeier ( 2013 1 under condition (id), 
that the spectral radii of these operators are given by the log-convex and differentiable function 


(2.3) 


fc(s) := lim (E||M„...A/if)" 

n—^oo 


and that for each s G Ip there are 

• an unique normalized function Vg G C (5); 

• an unique probability measure G 'P{S) satisfying 


(2.4) 


= k{s)rg and = k{s)i's- 


Moreover, the function Xg is strictly positive and s := min{s, 1}-H61der continuous. The sup¬ 
port of the measure is given by supp Vg = P(r). Equation (12.41) yields that if ^( 7 ) = 1, then 
h{u, t) := e'^*r*{u) is an harmonic function for the Markov chain {Un, Sn). Using the idea of Doob’s 
h-transform, one can introduce new probability measures P^, and it turns out that under P)), 5'„ 
has drift k'f^), i.e. 


(2.5) 


Sn 

lim — = 

n—¥oc) fi 


k'{l) 

Hi) 


P2-a.s. 


This idea can be extended (see ([Buraczewski and Mentemeien . l2015l Section 2) for details) to yield 
exponentially shifted probability measures P^ for all 1 G Ip, suc h that the property (12.511 holds. 

We will make use of t he following estimate: We obt ain from (iBuraczewski et l20l4 Corollary 
4.6) for Condition (C), ( Guivarc’h and Le PaeeL 2015 . Lemma 2.8) for condition (i-p,o) (the proof 
working for (id) as well) that for all s G Ip there is a constant Cg, independent of n, such that 


(2.6) 


E[||n„r] < Cgfc(s)". 
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2.5. Case distinction concerning N. We will make the following case distinction concerning the 
number N: 


(N-random) N G N is random with 1 < EN < oo, and conditioned upon N, are i.i.d. with law 

/r, and the variables Q and (N, (Ai)i>i) are independent. 


(N-fixed) iV > 2 is fixed, 
bounded 


{Ai,..., A]s[,Q) having any dependence structure, and lJ,^]^suppAi is 


The case (N-fixed), without any loss of generality, can be reduced to the situation where all the 
random variables Ai,..., Ajv are identically distributed. This is achieved by replacing Ai with 
where r is a random per mutation, independent of^ A,))^,, distributed uniformly on the symmetry 


group of {!,...,TV}, see (jBuraczewski et al.l . l2013l Proposition A.l) for more details. The same 
argument can be used to supply the even stronger property that Ai,, Ai\f are exchangeable, i.e., 
for any (vector valued) function / on M{d x d, R)^ and any permutation tr of {1,..., N} it holds 
that 


(2.7) 


f{Ai 


I ^ n ) = f{Ac 


(i)> 


, ^cr(Ar))- 


This property also follows immediately (for any permutation of a finite index set), if {Ai)i>i are 
i.i.d. Hence for both cases, (N-random) and (N-fixed), we can now introduce the following standing 
assumption: 


(StA) 


are identically distributed and exchangeable. 


We then set 

:= C{A\g-), 

i.e. n is the law of the transpose of Ai. Then the general, multivariate version of the function m(s) 
(see Eq. (US) is given by 

m{s) := (EN)k{s), 

with fc(s) as defined in (12.31) (with Mi, M 2 ,... being i.i.d. random variables, having law fi). 


3. Statement of Results 

Here is our main result in the multidimensional situation: 


Theorem 3.1. Assumptions: 

(1) Let either (N-random) or (N-fixed) be satisfied. 

(2) Geometrical assumptions: Assume one of the following 

(Ga) Ai and Q are nonnegative, /r satisfies (C) and is nonarithmetie, or 
(Gb) Ai are invertible and satisfy (i-p,o) or (id). 

(3) Moment assumptions: Assume all of the following 
(Ml) There are 0 < a < fi and £ > 0 such that 

mia) = m{/3) = I, E|Q|'^+^<oo, E ||A}f< 00 , 

(M2) there is a nondegenerate random variable X satisfying (11.11) with E|A|* < 00 for all 
s < (3. 

Then for this X and all u G S, 

liminf ((m, X) > t) > 0. 


As a corollary of this results we obtain that the asym ptotic behavior proved in (jBuraczewski et al. 


20131 Theorems 2.7, 2.9 and 2.11) and (jMireld . 1201 3l Theorem 1.9) is exact. Tail estimates for the 


case of random N in the multivariate situation have not yet been considered in the literature and 
this is the first result in that direction. 
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3.1. Structure of the paper. We proceed in Section |4] by introducing the weighted branching 
process, which allows for the study of the fixed point equation (HU) by iteration and for the con¬ 
struction of random variables, which satisfy the equation a.s. (in contrast to in law). Using that the 
support of these random variables is unbounded, we can estimate P (|X| > t) from below by a union 
of events of the type “one large term occurs”, this is made precise in Section[5l with the fundamental 
estimate being proved in Lemma l5.21 Section | 6 ] is mainly combinatorical, there we count the number 
of events occuring in the union, and estimate from above the probability of intersections, which we 
make small by an appropriate choice of parameters and thereby complete the proof of the main 
theorem in Section [71 An outline of the proof is given in Subsection 14.31 

Remark 3.2. We have tried hard, but were not able to avoid the case distinctions concerning N. A 
natural way to do this would be the use of a spinal-tree-identity (many-to-one lemma), but it seems 
that our approach is not compatible with this technique. The main difficulty is that we consider 
sums over particular subtrees (as defined in (16.11) '). which we were not able to reformulate in such a 
way that a many-to-one-lemma would be applicable. 

4. Weighted branching process 

In this section, we introduce the weighted branching process, i.e. a sequence of random variables 
which satisfy Eq. HU almost surely. 

4.1. Trees. Let N = {1, 2,...} be the set of positive integers and let 

OO 

U= y 

k=0 

be the set of all finite sequences i = U ... 7 „. By 0 we denote the empty sequence. For i = U ... 
we denote by |i| its length and by \\k = ii... ik the curtailment of i up to first k terms. Given i € U 
and j £ N we define ij = ii... inj the sequence obtained by juxtaposition. In the same way we 
define ij for i,j G U. 

We introduce a partial ordering on U, writing i < j when there exists ii € U such that j = iii. If 
i, i' £ U, we write j = i A i' for the maximal common sequence of i and i', that is, j is the longest 
sequence such that j < i and j < ib 

We say that a subset T of U is a tree if 

• 0 £ T; 

• if i £ T, then i|fc £ T for any k < |i|; 

• if i £ T and j £ N+ then ij £ T if and only if 1 < j < Ni, for some integer Ni > 0. 

Then 0 is the root of the tree. 

In case (N-random), let (A^i)iGU be a family of i.i.d. copies of N, which thus determines the shape 
of the tree T. By Jr we will denote below the cr-algebra generated by (A'i)igu. In case (N-fixed), 
the shape of the tree is deterministic, then T = ... ,N}^. 

4.2. Random variables indexed by U. To each node j £ U we attach an independent copy 
Aj := (Qj, (Aji)i>i) of A := {Q, (Ai)j>i) and, given a random variable A £ satisfying HU , an 
independent copy Aj of A as well. We identify (Qb, (A 0 i)i>i) = {Q, (Ai)j>i). We refer to as the 
weight pertaining to the edge connecting j. Denote the total weight on the unique path connecting 
the edge j with the edge ji by 

■= ■ ■ ■ Aji, Ilj :=n 0 j 

and define the empty product to be the dx d identity matrix. Due to the assumption N < oo P-a.s., 
each generation of T has a.s. a finite size. Notice also that in view of (IStAI) the law of Bjji depends 
only on the numbers of factors and coincides with the law of Hi. 




8 


DARIUSZ BURACZEWSKI*, SEBASTIAN MENTEMEIER^ 


Recall that we defined to be the law of A* and Mi, M 2 ,... to be a sequence of i.i.d. random 
variables with law /i. Then 11* := M„ • • • Mi has the same law as lit for every i G T with |i| = n 
and moreover. 


p (nt • u G •) = Pu (t/n e •), P (log |ntu| g •) = p« (5„ g •) 

(for the definition of (C/„, 5'„)„>o see (EU). 

We write [T]j := {i G U : ji G T} for the subtree of T rooted at j, and define in general the 
shift operator acting on functions of the family (.4i, Xi)igi[j by 

With this notation, Hjji = [Ilijj. The random variables 

Ti- ^niQi + ^niXi, i>i, 

(4.1) |i|<z |i|=i 

To ■=X@. 

are called the weighted branching process associated with (Q, (^Oi>i) They satisfy 

N 

Yi = Y,MYi-i]i + Q, 

where are i.i.d., with the same law as Yi-i. Since X{ are solutions to (jl.lD . then in particular, 

Yi=X for all I G N. 

We define moreover 

(4.2) ■■= Yl MYi-\i\-ih+Qu />|i|. 

Then for / = |i| + 1 we have 

(4.3) Tjife := AijXij+Qi. 

j^k,j<Ni 

By (IStAI) the random variables {Zi^ik)i<k<Ni are obviously identically distributed. To simplify our 
notation we define 


For every I G N and i G T with |i| < I, we can rearrange the sum in Eq. (HU) to obtain the a.s. 
identity. 

(4-4) Yi = ni[ii_|i|]i+ 

fc<iii 

Observe that this implies for |i| < I the following identity in law. 

(4.5) Y, ^ UiX,+ Y^ih-^ZiM,. 

k<\i\ 
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4.3. Outline of the proof. This identity may give a first idea, how we are going to proceed in the 
proof of the main theorems: We consider sets where Hi is large, while the remaining sum is small. 
Therefore, we in turn study sets where ||ni|| is large, but smaller products are comparably small (with 
the comparison governed by a parameter Cq). The probability of suc h sets will be estimated using 
l arge deviation results for products of random matrices, obtained in Buraczewski and Mentemeieil 
(120151 1. Then the probability that X is large will be estimated from below by the union of sets as 
described above, over different i. It will be convenient to not take the union over i from the whole 
tree, but rather from a sparse subtree, in order to make the events sufficiently disjoint. The relative 
size of the subtree will be given by a parameter Ci, which will be a free parameter of the proof. 

A particular problem in the multivariate situation is to compare niXi with ||ni||. We deal with 
this question at the beginning of the next section, the better part of which is devoted to formulate 
precisely the heuristics we described above. 


5. First estimates 

We start this section by a lemma stating that X has unbounded support in “all” directions of 
resp. IR.>, which we will make use of subsequently in Lemma 15.21 which gives the fundamental 
comparison between P (1X1 > t) and the union of large deviation events. 


Lemma 5.1. Assume that the hypotheses of Theorem \S.l\ are satisfied and that X is not a.s. con¬ 
stant. Then for all D > 0 there is J < oo and £j > 0, 1 < j < J, and a k > 0, such that there are 
disjoint subsets flj of with 


X £ Vlj and |X| > — j > 


and moreover 
(5.1) 


where fl* are the cones 


c U n-, 


Vl* := {z € : {z,x) > Ej \z\ |a;| for all x € 12^}. 

If p, satisfies (C), then the same statement is valid, but with being subsets of and (EH) 
replaced by 

J 

K> c IJ n*, 

Proof. Let Xi,..., X^ be i.i.d. copies of X (with N constant or random). Set B := + 

Q. Since Xi are i.i.d., nontrivial and independent of (Ai)j>i and Q, it follows that also B must 
be nontrivial. Moreover, due to the moment assumptions (Ml) and (M2) and the convexity of 
k{s), there is s € s.t. fc'(s) > 0 and B has a finite moment of order s. Then X satisfies 

c 

the equation X = AiXi -j- B , and for X satisfying such an equation, the results are shown in 
(jBuraczewski and Mentemeien . 120151 Lemma 10.2). Note that there only the condition k'(s) > 0 
is relevant (which follows from the convexity of k); the additional condition (stated there), that 
k(s) = 1 is not needed. 

□ 
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Now we turn to the announced estimate from below for P {{u,X) > t). Our estimates will be 
given in terms of the sets 

:={|nrw|>t and||nf|J|(|Zi|,^Jvl)<e-(l‘l-^')^CotVfc<|i|}, 

bbuct := [\ulu\ > t, |nf,u| > t, liniAi'll < 

for some constants Cq and <5 that will be defined below. 


Lemma 5.2. For all u € S, Co > 0 there is k > 0 such that for all t > 0 and all a.s. subsets 
Wc T, 


(5.2) ¥{{u,X)>t) > kE 


-E ^ ^ P(Vbi.E,t) 


iGW 


G [— 00 , 00 ). 


Proof. Fix I G N. From Eq. (14.41) we obtain that 

(5.3) l^,>^i)l>|(nrM,[b,-|i|]i)|- ||ni|,_J|z,.i|J. 

fc<|i| 

For arbitrary Co,S,t > 0 let D = 1 + Co/(l — e”"^) and introduce the family of sets 
Id.t := U n {Xi € fl, and |Xi| > ^} O {n^u G , 
where the cones flj and il* were defined in Lemma [5d| as well as the sets 

j 

ViPt ■■= U ({ inr^l > i and ||nf|J(|Zi.i|,^Jvl)<e-(l‘l-'=)''C'otVfc< |i|} 


1=1 


n{[yi_|i|]i G n, and |[yi-|i|]i| > -} n {nfuGO*}). 

Sf ' 


The latter sets are defined in such a way, that on 

|(u,yi)| > |nru| |[D_|i|]i| - ^ ||ni|,_J (|Zi,i|J v i) 

fc<|i| 

>Dt-Y^ 

fc<|i| 

which is larger than t upon choosing D large enough. Here again, we need the a.s. version (iri) . 
£ 

Since X = Yi for any / G N, we obtain the following estimate 


P(|(u,X)| > t) > P 



> 


U I'm.* 


. |i|<i.iGW 


which is valid for any a.s. subset W C T. Below, we use the shorthand W; = {i G T : |i| < ^ i G W}. 

Assuming that sup^gp^E ^Vi ■ t (this will be shown below), we use the inclusion- 

exclusion formula and separate as follows: 
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Now observe that fl C which gives the estimate 

U >e( ) -E ^ Iw,, 

As for obtaining Eq. (14.511 from Eq. (14.4L we have that 

P-a-s.. 

The union in the definition of Ei,t is over disjoint sets, since the flj are disjoint. Furthermore, Xi is 
independent of Vi^t and lit, and has the same law as X, hence 

,7 

(5.4) 


(foi.t) = (^M ^ G P (Xi G Qj and |Ai| > - 

= p ("a G nj and |A| > — ) ^ G n*}) 

\ foV .1 


> kP (Ei,t). 

Thus, we have obtained the following estimate, valid for alH G N: 


(5.5) 


'(|(77,A)| >t) > I7E ^P(Eu) -E ^ ^ Iw, 


\iGWi 




We finally have to justify that sup/gj^E ly^ ^ < oo. But estimating similar as in (15.4|) . we 
obtain that P (^i,t); and the supremum is obviously bounded by 

JE^P(Ei,t) < JE^P(|nf?7| >t) = JC^(EiV)"P„(5„ > logt) < CjJ2 

iGT iST n=0 n=0 

where we used the Markov inequality for some s G (a, (3) and the estimate (12.611 in the last step. □ 

5.1. Estimates for P (Ei,t)» P (Wi,i',t)- The further analysis of Eq. (15.211 splits in two parts. On the 
one hand, we have to estimate the probabilities appearing there, in terms of the distance between 
i and i' and on the other hand, we have to do some combinatorics on the tree, in order to do the 
summation. In this section, we bound the probabilities. The set W will be defined precisely in the 
next section. However it will be a subset of the tree T consisting of vertices i such that 

nt - v/nt < |i| <nt- y/^tl‘2-, 

where rit = \\ogt/Q\ and g = m!{j3). Thus the estimates provided below will be only for this 
particular set of indices. 

5.2. Probability of Vj In view of (IStAll . the probability of Vj * does in fact only depend on 

n= |i|. Let (M„, Z„) be a sequence of i.i.d. copies of (Aj;, Z) for Z = + Q- Define, with 

n* := 

Vn,t := {|n>|>t and ||n^|l (|Zfc+i| V 1) < Vfc < n}. 

Then P iVu) = P (14,,t) as soon as |i| = n. 

The sets Vn,t were already considered for d = 1 in Buraczewski et al. |_(l2^^)_(Theorem_^A}_a2d 
for d > 2 (under the same hypotheses as in the present paper) in iBuraczewski and Mentemeier 
( 2015ll (Lemma 10.5). We refer to these two papers for the proofs of the following lemmas. 
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Lemma 5.3. Assume that ElZp < oo for some 7 > 0, then there are constants 6, Co, Di, D 2 , No > 0 
such that 




<r{\Ku\>t)<D2- 


KPT 


Di 




< P {Vn,t) < D 2 ■ 




for all \\ogt/ q\ = nt > No and every nt — y/nt < n < nt — yfntj^- 

For the assertion of this lemma to hold, k{j3) = 1 zs not necessary, we only need that fc'(/3) > 0. 


5.3. Probability of The main result of this subsection is the following lemma 

Lemma 5.4. Assume that the hypotheses of Theorem \3.1\ are satisfied. Then there is y > 0 such 
that for all t > e^^° and all |i| > |i'| with nt — y/nf < |i| , |i'| < nt — y/nfl"! it holds that 


(5.6) 


P(VFi,i'.t) < 


C2k{pf \ fc(/3)l''|-|‘'^''l 


with a constant C 2 which is independent oft. 


Proof. We will start with some general calculations, and then deal with the cases (N-random) and 
(N-fixed) separately. Denote the joint law of (^ 1 ,^ 2 ) by rj. Recall that for each ieT, l<fc<Z<7Vi, 
£ {{Aik, Ail)) = 7 as well. For i, i' G T, we write 


io = i A i', m = |i A i'| = |io| , i|m+i = io^, i'|m+i = io^, £ = |i|, 9 = 


= 1 


Ui := nfn/|nfn| 


and recall the notation Ifi y = Aij^ ■ ■ ■ Aij, IIDj = Ay • • • for the product of the weights along 
the path between i and ij. Then 


p(m.i'.t) 


= p (\u;u\ > t, |nt,n| > t, ||ni;,H| < 


p(|nU,iDi„fe| \AikUi,\ |nr„u| >L ||nt„;,i,|| ||Ai„i|| \\ui\\>t, ||nrj| < 

, , ,,,,,,, \ 

P • Uij\\AikUio\ |n>| > t, ||nr„;,i,|| > 

r ( p'Sdil-mlX 

J P(|nU,iK-t^io)|l«tc^iol|nr„«| ><) P(^||n^i,|| 


\U*^+i,p{al-Urn)\\atU^\\Ulu\>t) 


^S{p-m) 

^ CM\ 


ri{dai,da2) 


(5.7) < J V{\U*^+,Jal-U^)\\alU,^\\U*^u\>t) r,{da,,da2). 


where we conditioned upon {Ai^k, Ai^i) and used the Markov inequality in the last step. The reason 
that we applied it with the exponent a is that we will be able to replace E ||n„||“ by k{a)'^, but the 
latter one equals also k{f3). 

^From now on, we will consider the cases (N-random) and (N-fixed) separately. 


Case (N-random) 
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In this case, a\ and 02 are independent, and Eq. (EH) simplifies to 


^OL5{p—m) 


< 


D',k{p)p cs 


ga5(p-m) 


by Lemma [531 and Eq. (12.61) (recall k{a) = (EA^)“^ = k{f3)), with C 2 '■= and x := ol5. 


Case (N-fixed). 

In this case, ||a 2 || is bounded by some constant ca, say, and Eq. EH simplihes to 


|n,„iir]Co“cS 


E[||n,_^_iincgc^ 

pa6{p—m) 


ri{dai,da2) 


= p(|n;it| >t) 


gQ;(5(p—m) 


and from here, we can proceed as before to obtain the estimate (15.61) with C 2 '■= D2C^CQd\/k{(3) 
and X := D 


6. Combinatorics on the tree 


In this section we consider N to be random. If N were fixed, then the calculations are similar 
but easier. As we mentioned above the subset W of T will contain only some of the nodes satisfying 
nt — y/ni < |i| < rit — sJrTtl^ and therefore will also depend on t. We will consider only a sparse 
subset of those nodes. Namely only nodes from every Cith-generation, which moreover end with Ci 
one’s. The number Ci G N will be a parameter of the proof, to be fixed at the very end. It’s choice 
will be independent of t. Note however, that the estimate P ((u, X) > t) > et^ is only valid for large 
enough t, namely t > Cie^°P. Since it is sufficient to show that liminft_>oo {{u, X) > t) > e > 0, 
we will even restrict to such t, for which yJntj^Cx is an integer. This is not really necessary, but 
simplifies expressions. 

Below, we will often use that 1 = to(/ 3) = fe(/3)EA^, and therefore, fc(/3) = (EiV)“^. Keep in mind, 
that under (N-random), the shape of the tree is random, as will be that of its subset. Therefore, we 
have to take expectations with respect to the shape of the tree. 


To be precise, upon fixing t, let nt = [logt/p]. We will consider nodes the generations of which 
are from the set 

it = {fc G CiN : nt - ^Jnl < kC\ < nt - ^/th/2}. 

Note ^Lt = since we assume the latter to be integer. Denote by Ici = 1... 1 G U the sequence 
consisting of Ci one’s. Define 

(6.1) W=|iGT; |i| G T* and i = i||i|_Cilci}- 

We will calculate below several times the expected number of elements of W lying on the level 
k G Lt'. 

(6.2) E[#{i GW: |i| = k}] = E[#{i G T : |i| = fc - Ci}] = (EiV)'""^^ = k{l3f^-^ 


Lemma 6.1. For all t large enough, (and sueh that ^fntl2C\ G 

Difc(/3)Ci 1 


E 




lew 


> 


2(71 


tF 












14 


DARIUSZ BURACZEWSKI*, SEBASTIAN MENTEMEIER^ 


Proof. Using Lemma l5. 3 


E 




iGW 




Di 1 ^ 

> -4 ■ 

--iGW 

Dik{/3)^^ 1 

“ 2 C^ ¥' 


= 0] 


□ 


This in particular shows the finiteness of the left-hand-side for each t and this particular subset 
W. Therefore, Lemma (I5.2p applies to give the estimate of the first term in (15.2p . and we can proceed 
computing the sum over the mixed terms. 


6.1. Calculations involving 

Lemma 6.2. There is 0 < rj < x (independent oft), such that for all t large enough 


E 


E E 

iGW i'GW,|i'|<|i|,i^i' 




C2k{Pf^^ 1 

gryCi fP ■ 


Proof. Step 1: We are going to reorganize the summation over i, i' £ W, by ordering them according 
to latest common ancestor, io := i A i'. Introducing as before the notation 


Wio,t = {iGW: i > io}, |iAi'| = m, 


P, 


9, 


the restrictions io = i Ai', i, i' £ W, |i'| < |i| translate to (we omit the restriction i ^ i', since anyway 
we want an estimate from above) 

i,i'£Wio,t, maxim + Cl, nt - < p < nt - y/^/2, m < q < p. 

Similar to Eq. dSSl), we compute for I > m, I G Lt, the expected size of the Vth generation in 
to be 

E[#{i £ Wi„,t : |i| = 1}] = E[#{i £ Tt : \i\=l-Ci, iU = io}] = (EiV)'”’"”^^ = 

Abbreviate the lower bound for p by p* := max{m -I- Ci,nt — ydi}}, and the upper bound 
n* := nt — ^Jntl2. Then, using (15.61) 


E 


< E 


< E 


C2k{pY k[PY- 


^ iGW i^GW,|i'|<|i| ,i^i^ 

E E E E E 

m<n* iGTiQ^t,|i|=p ^^9<P {i': io=iVi',|i'|=g} 

^m<n* p*<p<n* iGTiQ,t,|i|=p rn<q<p 


t^y/nf exip-^) _ 


t^y/^ ex(P-™) _ 
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< E 


E E E 

'-m<nl Pt<p<n* iGTi„,t.|i|=p 


C2k{/3)P+^^ {p — m) 


< 1 ( q\Ci+m-p C‘2ki/3)P+'^^ (p - m) 

- i/3^ ex(p-™) 

m<n* Pt^P'^'^t ^ 


C2 




E E 


gX(p-m) 


m<n* p* ^p^n* 

Now, we have to split the remaining summation, depending on which lower bound for p we are going 

to use. Before we do so, we note that since p — m > Ci, we can estimate 

p — m 1 

^ < 


gX(p-m,) — gp(p-m) 

for some 0 < r/ < x, as soon as C\ is large enough. 


C 2 


< 


< 


C2 

t^V^t L 
C2 


E E 

m<n^ p*<p<’T'* 


^(^)m+ 2 Ci 


gXlp-m) 


E E 


fc(/3)™+2Ci 


m+Ci<nt — ^/rH nt — y/ru<p<r, 

^(^)m+ 2 Ci 


gp(p-m) 


E 


E E 

nt-^/rH<m+Ci<n1 m+Ci<p<ra* 

fc(/3)’"+2Ci 


k{l3)'^+^Ci 

gp(p-m) 


E 


gP[Ci+nt-yTiT-(m+Ci)] Z—i 

m+C\<nt — y/^ rit —y'ru<iTi+Ci<Tit 


G2fc(/3)2Ci 

1 1 y/rit 1 

G2fc(/3)2Ci 

1 1' 


gpCi gTji 2 

L 1=0 ^ 


x/Rt(l - e P) 2 


< 


For t large enough, the factor in the brackets becomes smaller than one, and we obtain the assertion. 

□ 


7. Proof of the main theorem 

The proof is just a consequence of the previous results. Lemma 15.21 provides lower estimates of 
P {{u,X) > t), that is (j5.2|) in terms of a subset W of T and probabilities P (VI,t), P (fVi.i',*) for 
i, i' € W. Estimates of those probabilities were given in Lemmas 15.31 and 15.41 and the set W was 
defined as the beginning of Section | 6 l In view of Lemmas 16.11 and 16.21 we obtain 

P((„,X)>() > 

Finally, choosing a large Ci such that the last constant is positive we conclude the result. 
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